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We report a measurement of the branching fraction of B^ —> decays using a data sample 

of 772 X 10®-Bi3 pairs, collected at the T(4S') resonance with the Belle detector at the KEKB 
asymmetric-energy e''"e“ collider. We reconstruct the accompanying B meson in a semileptonic 
decay and detect the recoiling B candidate in the decay channel B'^ —>■ r'^Vr- We obtain a branching 
fraction of B{B'^ —> r^Vr) = [1.25±0.28(stat.)±0.27(syst.)] x 10“^. This result is in good agreement 
with previous measurements and the expectation from the calculation based on the Standard Model. 

PACS numbers: 13.20.He, 14.40.Nd 


In the Standard Model (SM), the branching fraction of 
the purely leptonic decay [1] is given by 




G^mBrri^ 

Sir 



2 

/IlKhpTB, 


( 1 ) 

where Gp is the Fermi coupling constant, Vut the 
Cabibbo-Kobayashi-Maskawa matrix element, ms and 
rrir the masses of the B meson and the t lepton, re¬ 
spectively, tb the lifetime of the B meson, and /b the 
B-meson decay constant. The branching fraction de¬ 
pends strongly on the mass of the lepton due to helicity 
suppression, and thus B'^ -T- is expected to have 


the largest leptonic branching fraction of the B'^ meson 
and is the only decay of this kind for which there is ex¬ 
perimental evidence. All of the inputs of Eq. 0 are 
measured experimentally [2] except for /b, which is cal¬ 
culated in the framework of lattice quantum chromody¬ 
namics [3] . An estimate of the branching fraction, which 
uses a unitarity-constrained Vub value aggregated from 
other measurements and lattice calculations of Jb, gives 
B{B+ ^ T+Pr) = (0.75t°:J°) X 10-4 m. 

Physics beyond the SM, such as the presence of addi¬ 
tional charged Higgs bosons [3 [6], could interfere con¬ 
structively or destructively with the SM weak decay pro¬ 
cess. Measurements by the BaBar [71 [5] and Belle |5] 
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collaborations showed a slight disagreement with the 
SM expectation, but the most recent measurement by 
Belle [TU], using a hadronic tagging method, is in very 
good agreement. The current world average, —>■ 

T+Vr) = (1.14±0.27) X 10-4 [2], shows no sign of physics 
beyond the SM. 

The analysis described here contains the following im¬ 
provements over our previous measurement [^: an im¬ 
proved semileptonic tagging method; the reconstruction 
of an additional r decay channel; a newly optimized se¬ 
lection, including a dedicated suppression of background 
containing converted photons and a multivariate suppres¬ 
sion of continuum e+e” qq {q = u, d, s, c) background; 
the inclusion of a second variable, the momentum of the 
decay product of the t, in the final fit; and the usage of 
the full Belle data set, which contains almost 20% more 
data than in the previous analysis. 

The measurement is performed using the final Belle 
data sample consisting of an integrated luminosity of 
711fb-4, containing (772 ± 11) x lO^BB pairs, collected 
at the T (45”) resonance at the KEKB asymmetric-energy 
e+e” collider m- We use a smaller data sample with 
an integrated luminosity of 79 fb-^ taken at an energy 
60 MeV below the T (45') mass to study background from 
continuum ^ qq events. We generate Monte Carlo 

(MC) samples of signal and background events. We 
model the decays of unstable particles using the software 
package EvtGen HU and we simulate the detector re¬ 
sponse using GEANT3 HU. The simulated signal events 
are overlaid with beam-related background events that 
were recorded with a random trigger. 

The Belle detector is a large-solid-angle magnetic spec¬ 
trometer that consists of a silicon vertex detector (SVD), 
a 50-layer central drift chamber (GDC), an array of 
aerogel threshold Cherenkov counters (ACC), a barrel¬ 
like arrangement of time-of-flight scintillation counters 
(TOE), and an electromagnetic calorimeter (ECL) com¬ 
posed of CsI(Tl) crystals located inside a superconduct¬ 
ing solenoid coil that provides a 1.5 T magnetic held. 
An iron hux-return located outside the coil (KLM) is in¬ 
strumented to detect mesons and to identify muons. 
The detector is described in detail elsewhere HU- Two 
SVD conhgurations were used. A 2.0 cm beam pipe 
and a three-layer SVD were used for the hrst sample of 
152 X 10^BB pairs, while a 1.5 cm beam pipe, a four-layer 
SVD and a small-cell inner drift chamber were used to 
record the remaining 620 x 10^BB pairs HU - 

Since the detectable signature of a —>• de¬ 

cay is often only a single charged track and thus inad¬ 
equate for an efficient signal-background discrimination, 
we reconstruct the accompanying B meson (referred to as 
Stag) in the semileptonic decay channels B'^ —>■ 
and where i can be an electron or muon. 

The D*^ mesons are reconstructed through the decays 
£)*o and and the mesons through 

the decays —>■ 'it~^ 

AT^tt+tt-, 7r“'"7r-7r*^, Kgir^, KgK^K~, K^K~, 
and TT+Tr”. Neutral pions are reconstructed as tt® —>■ 77 


and Kg as Kg —>■ tt+tt”. 

To maximize the efficiency in identifying Stag candi¬ 
dates, loose selection criteria are applied throughout their 
reconstruction. Charged Hnal state particles are selected 
from well-measured tracks and are required to have a 
distance to the interaction point along (perpendicular 
to) the beam direction, denoted as dz (dr), of less than 
4 (2) cm. Photons used for the reconstruction of neutral 
pions are required to have an energy of at least 30 MeV 
and the invariant mass of the two-photon system (M^j) 
must satisfy — rriT^ol < 19 MeV/c^; this corresponds 
to a width of 3.2 standard deviations {a). The invariant 
mass of the two charged tracks that are used to form Kg 
candidates must lie within 30 MeV/c^ (4.5 (t) of the nom¬ 
inal Kg mass. The momenta of meson candidates 
are required to be below 2.5 GeV/c to reject mesons 
from e“''e“ —>■ cc events. 

All further selections related to the Stag candidate are 
performed by training multiple instances of a multivari¬ 
ate selection (MVS) method based on the NeuroBayes 
package HU- An MVS classiher is trained for each of the 
reconstructed decay channels, where a large sample of 
simulated B mesons that decay generically is used. The 
variables used in the training of the MVS related to the 
intermediate particles are identical to those used for the 
hadronic full-reconstruction method HU and the same hi¬ 
erarchical approach is used. The mass, momentum, and 
decay channel of the particle candidate, as well as the mo¬ 
menta, angles, and the output of the separately-trained 
MVS of daughter particles are used in the training, if ap¬ 
plicable, in addition to particle-specific information like 
the output of the particle identihcation. The training 
related to the Stag candidate is performed using the fol¬ 
lowing information, sorted by their discriminating power 
in descending order: the outputs of the MVS of the de¬ 
cay products; the mass of the meson candidate or the 
difference of the masses of the D*^ and the meson 
candidates, depending on the decay channel; the angle 
between the meson candidate and the lepton in the 

center-of-mass system of the T(45) (CM); the angle be¬ 
tween the Stag candidate and the meson candidate 

in the center-of-mass system of the Stag candidate; the 
distance of minimum approach between the decay 

vertex and the trajectory of the lepton; the decay channel 
of the meson; and the angle of the reconstructed 

Stag candidate with respect to the beam axis in the CM. 

The training variables were chosen to be uncorrelated 
with the cosine of the angle between the momentum of 
the S meson and the system, calculated under the 

assumption that only one massless particle is not recon¬ 
structed. It is given by 


2SbeamS£,(»7: “ 


cos 6 




‘^PbP*dMi^ 


( 2 ) 


where p*^ is the nominal S meson momentum calculated 
from the beam energy and the nominal mass, 
rriDMe, and are the energy, mass, and momen¬ 
tum of the system, respectively, and Abeam is the 
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energy of the beam (all in the CM). While the indepen¬ 
dence between the angle and the output of the MVS is 
not exploited in this analysis, it can be used in prin¬ 
ciple to produce signal-free sideband samples to study 
backgrounds and to extract the number of correctly re¬ 
constructed Stag candidates. It is also used later for a 
rehned selection since correctly reconstructed Stag can¬ 
didates have values between —1 and 1, while background 
events where the assumption of only one missing massless 
particle fails have a much larger range of values. Some 
Stag candidates are reconstructed partially because of a 
missing slow pion or soft photon and lie in a broader 
range that still peaks around the signal region. 

The Stag candidates are combined with S mesons re¬ 
constructed in the decay S+ —)• the latter are de¬ 
noted as Ssig. The r lepton is reconstructed as —>■ 

and p'^Vr, with —>■ tt+tt®. 

Since the neutrinos cannot be detected, the Sgig can¬ 
didate consists of a single charged track or a candi¬ 
date. The p'^ candidate is required to have an invariant 
mass within 195MeV/c^ of the nominal mass. The 
signal-side decay modes are separated based on parti¬ 
cle identification variables. The pion and kaon sepa¬ 
ration uses information from the ACC, TOF, and the 
dE/dx measurement in the CDC; the electron identi¬ 
fication is based on the same information in addition 
to the shape of the shower and the energy measure¬ 
ment in the ECL; and muon candidates are identihed 
using hits in the KLM matched to a charged track. The 
selections of the signal-side decay modes are mutually 
exclusive. The momentum of the signal-side particle 
(6+,/x+, 7r+, or /9+) in the CM (p*ig) must be in the range 
0.5 GeV/c < p* g < 2.4 GeV/c. 

The combination of a Stag and a Sgig candidate pro¬ 
vides an T(4S') candidate. The fact that the T(4S') is 
produced without any accompanying particles allows for 
a powerful selection: we reject events with additional 7r° 
candidates or charged tracks with \dz\ < 100 cm and 
\dr\ < 20 cm. We also perform a selection in the re¬ 
maining energy in the ECL (Secl)) defined as the sum 
of the energies of clusters in the ECL that are not as¬ 
sociated with final state particles of the reconstructed 
T(4S') candidate. To mitigate beam-induced background 
in the energy sum, only clusters satisfying minimum en¬ 
ergy thresholds of 50, 100, and 150 MeV are required for 
the barrel, forward, and backward end-cap calorimeter, 
respectively. Signal events peak near low values of Eecl 
as only photons from beam-related background and mis- 
reconstructed events contribute, while the background is 
distributed over a much wider range. We require Eecl 
to be smaller than 1.2 GeV. 

In the decay channel r'*' —)■ e'^Dri^e, a significant back¬ 
ground arises from events containing converted photons. 
To suppress this, we combine the electron used in the 
reconstruction of either Bgig or i3tag with every other 
oppositely-charged track in the event. Using the electron 
mass hypothesis for the second track, we require the in¬ 
variant mass of the electron-track pair to be greater than 


200MeV/c^ for any of the pairs. To suppress background 
from continuum events, we train another MVS with the 
following input variables: the angle of the Stag candi¬ 
date with respect to the beam direction in the CM; the 
angle between the thrust axis of the Stag candidate and 
the remaining tracks in the event in the CM; 16 modi¬ 
fied Fox-Wolfram moments |18) ; and the momentum flow 
in nine concentric cones around the thrust axis of the 
Stag candidate [T^ . The background contributions differ 
significantly between the r decay channels. Therefore, 
the requirements on the output of the two MVS (for the 
Stag and continuum suppression) depend on the r de¬ 
cay channel. This is also true for the requirement on 
cos0g it must be less than 1 in all channels and 

greater than —1.7, —1.9, —1.3, and —2.6 for the t de¬ 
cay channels to muon, electron, pion, and p, respectively. 
The selections related to the output of the two MVS clas¬ 
sifiers, cosBq the particle identification of the tt"*" 

from hadronic r decay channels, and are opti¬ 

mized using samples of simulated signal and background 
events to give the highest figure-of-merit Ns /V Ns + Nb , 
where Ns and Ng are the number of expected signal 
and background events in the region Eecl < 0.2 GeV, 
respectively, as estimated from simulation and assuming 
B{B+ T+Vr) = 0.75 X 10-4. 

The fraction of signal events having multiple candi¬ 
dates is 7%. In such events, we choose the candidate 
with the maximal value of the tag-side MVS classifier 
output. From MC simulation, we find that this method 
selects the correct candidate 70% of the time. The overall 
reconstruction and selection give a total reconstruction 
efficiency of e = (23.1 ± 0.1) x IQ-^, where the uncer¬ 
tainty is due to MC statistics only. It is described in 
detail in Table |T] The given efficiencies contain all rel¬ 
evant branching fractions, including the corresponding 
branching fractions of the r lepton. 

TABLE I. Reconstruction efficiency (in IQ-^) for each r de¬ 
cay mode, determined from MC and calibrated according to 
control sample studies. The uncertainty due to limited MC 
sample size is below 0.1 for all values. The row denotes the 
generated decay mode, and the colnmns represent the recon¬ 
structed final state. The off-diagonal entries reflect the cross¬ 
feed between channels. 


Final state 



TX^Vr 

TV'^TT^^Vt 

e^Veyr 

6.6 

0.1 

0.2 

0.1 


0.1 

4.7 

0.6 

0.2 


0 

0.1 

1.6 

0.5 

+ 

o 

0 

0.1 

1.4 

4.9 

11^ V-r 

0 

0 

0.2 

1.3 

Other 

0 

0 

0.1 

0.2 

All 

6.8 

5.1 

4.0 

7.2 

Total 


23.1 



To study possible differences between real and simu¬ 
lated data, we use samples where B^ig is reconstructed in 
the decays B~^ —>■ and U+ —)■ (further 
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denoted as double-tagged samples). The mesons 
are reconstructed as —>■ D^tt^ and and the 

meson as —>■ The meson candi¬ 

dates are selected based on their mass. For the D*^ 
meson candidates, additionally, the mass difference be¬ 
tween the D*^ and the meson candidates is used for 
selection. We apply the same four sets of different cri¬ 
teria on cos9g and the two MVS classifiers corre¬ 
sponding to the four r decay channels of the nominal 
analysis to each of the double-tagged S-decay samples. 
We measure the branching fractions of the Sgig decays, 
determine the ratios with respect to the world average 
values [2], and calculate weighted averages of the ratios 
of the B-decay channels. The reconstruction efficiency is 
found to be overestimated in MC simulation by a factor 
of 1.09 ± 0.09,1.08 ± 0.08,1.17 ± 0.22, and 1.02 ± 0.10 
for the r decay channels to muon, electron, pion, and 
p, respectively. The reconstruction efficiency is corrected 
based on this ratio, depending on the decay channel of 
the Stag and the t. 

To extract the number of reconstructed signal 
events, we perform an extended two-dimensional un¬ 
binned maximum-likelihood fit in and FIecl- We 
use smoothed histogram probability density functions 
(PDFs) [5D] obtained from MC simulation to describe the 
signal and background components arising from events 
containing a BB pair. We use the product of one¬ 
dimensional PDFs for all components except for the sig¬ 
nal in T'^ —> and r’*' —> p'^Pr- In these modes, there 

is a significant cross-feed from channels with additional 
undetected neutral pions, resulting in a correlation be¬ 
tween i?ECL and Pgjg that is taken into account by using 
two-dimensional histogram PDFs. The continuum back¬ 
ground, including e+e” qq {q = u,d,s,c), t+t“, and 
two-photon events, is described using the off-resonance 
data and is scaled according to the relative luminosities. 
Since the off-resonance data sample is very limited, its 
F'ecl distribution is described by a signal-mode-specific 
linear function. The slope of these functions is compat¬ 
ible with zero in all but the r’*' —>■ p'^Dr decay chan¬ 
nel, which shows a slope of 37 events per GeV. The 
uncertainties are 14, 6,16, and 14 events per GeV, corre¬ 
sponding to a relative uncertainty of 20,16, 25, and 13% 
on the number of events in the range Eecl < 0.2 GeV 
for the —)■ —>■ e'^i'rVe,T^ —t and 

> p'^Pr decay channel, respectively. The relative un¬ 
certainty on the normalization is about 10% due to the 
limited size of the data sample. 

The ratio of the normalizations of the background com¬ 
ponents is fixed in the fit based on the yields estimated 
from simulation and off-resonance data. We compare the 
data and MC distribution of the signal component in 
Eecl and using the double-tagged sample, which re¬ 
veals no significant difference. This is illustrated in two 
representative plots in Fig. To validate the p*^^ dis¬ 
tributions, we treat the lepton from the B decay as the 
T-decay product. We apply the same validation to other 
control samples and variables such as cosOgjyMi, the 


outputs of the MVS classifiers, and the missing energy in 
the event. 



0.5 1.0 1.5 2.0 


Pi (GeV/c) 

FIG. 1. Comparison of the data and MC distribution in the 
double-tagged sample after the efficiency correction. The 
orange boxes show MC and the black markers the data. 
The upper and lower plots show the Fecl distribution in 
the —>■ sample and the p*ig distribution in the 

5+ —>■ sample, respectively. Both sam¬ 
ples are selected corresponding to the —>■ decay 

channel. 

The following five parameters vary in our final fit to 
the data: B{B~^ and the normalization of the 

background in each of the four r decay channels. The 
relative signal yields in the r decay channels are fixed to 
the ratios of the reconstruction efficiencies. We obtain a 
total signal yield of Wig = 222 ± 50, and this results in 
a branching fraction of B{B'^ —)■ = (1.25 ± 0.28) x 

10“'*, where the uncertainties are statistical only. The 
signal yields and branching fractions, obtained from fits 
for each of the t decay modes separately, are given in 
Table im The results are consistent with a common value 
with a p value of 10%. The projections of the fitted 
distributions are shown in Fig. 

The list of systematic uncertainties is given in Ta¬ 
ble |III| The following systematic uncertainties are de¬ 
termined by varying the corresponding parameters by 
their uncertainty, repeating the fit and taking the dif¬ 
ference to the nominal fit result as the systematic un¬ 
certainty: the normalization and slope of the continuum 
background component, where the dominant uncertainty 
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FIG. 2. Distributions for (a) r"*" —> (b) —>■ e^i'rVe, 

(c) t'^ —>■ 7r’'"i>T, (d) r"*" —> and (e) the sum of them. 

The left and right columns show the distributions of Secl 
and p*ig projected in the region Secl < 0.2 GeV, respec¬ 
tively. The markers show the data distribution, the solid line 
the total fitted distribution, and the dashed line the signal 
component. The orange (red) filled distribution represents 
the BB (continuum) background. 


originates from the error on the slope; the signal recon¬ 
struction efficiency; the branching fractions of the dom¬ 
inant background decays peaking in the Eecl signal re¬ 
gion, e.g., B'^ —followed by —>• K^Ke or 

—>• KlKlKl', the correction of the tagging efficiency, 
obtained from the double-tagged samples and assumed to 
be 100% correlated among the four t decay modes; and 
the branching fractions of the r lepton. For branching 
fractions of D mesons with multiple mesons in the 


TABLE II. Signal yields and branching fractions, obtained 
from fits for the r decay modes separately and combined. 
Errors are statistical only. 

Decay mode 

Wig 

5(10-^) 


13±21 

0.34±0.55 


47±25 

0.90T0.47 

r"*" —>■ 

57±21 

1.82±0.68 

T'^ —>■ 

119±33 

2.16T0.60 

Gombined 

222±50 

1.25±0.28 


TABLE III. List of systematic uncertainties. 

Source 

Relative uncertainty (%) 

Continuum description 

14.1 

Signal reconstruction efficiency 

0.6 

Background branching fractions 

3.1 

Efficiency caiibration 

12.6 

r decay branching fractions 

0.2 

Histogram PDF shapes 

8.5 

Best candidate selection 

0.4 

Charged track reconstruction 

0.4 

reconstruction 

1.1 

Particle identification 

0.5 

Charged track veto 

1.9 

Number of BB pairs 

1.4 

Total 

21.2 


final state, we use the values for corresponding decays 
with Ks and take 50% of the value as the uncertainty. 

To estimate the effect of the uncertainty on the shape 
of the histogram PDFs due to the statistical uncertainty 
in the MC, the content of each bin is varied following a 
Poisson distribution with the initial value as the mean. 
This is repeated 1000 times and the standard deviation 
of the distribution of branching fractions is taken as sys¬ 
tematic uncertainty. For the systematic uncertainty re¬ 
lated to the best-candidate selection, we repeat the fit 
without applying this selection. The result is divided 
by the average multiplicity of 1.07 and compared to the 
nominal fit result. The uncertainties on the efficiency 
of the reconstruction of charged tracks and neutral pi¬ 
ous and on the efficiency of the particle identification 
have been estimated using high-statistics control sam¬ 
ples. The charged-track veto is tested using the Zl°7r+ 
double-tagged sample by comparing the number of addi¬ 
tional charged tracks in MC and data events. We find 
that it agrees well and so take the relative statistical un¬ 
certainty on the control sample as the systematic un¬ 
certainty. We also test an alternative description of the 
continuum background in Eecl by using a polynomial of 
second order but the deviation is well covered by the re¬ 
lated systematic uncertainty so we do not include it sep¬ 
arately. The quadratic sum of all contributions is 21.2%. 

We find evidence for t'^Vt decays with a signifi¬ 

cance of 3.8 (T, by convolving the likelihood profile with a 
Gaussian whose width is equal to the systematic uncer- 
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tainty. The significance is given by ■\/2 ln(£/£o), where 
£(£o) is the value of the likelihood function when the 
signal yield is allowed to vary (set to 0). 

In summary, we report the measurement of the branch¬ 
ing fraction of decays using a sample of 

772 X 10® BB pairs, which we analyze with the semilep- 
tonic tagging method. Our result is 

B{B+ T+i^r) = [1.25±0.28(stat.)±0.27(syst.)] x 10-'‘ 

with a significance of 3.8 a. This result is consistent 
with our previous measurement based on the semilep- 
tonic tagging method of B{B^ —)• r+i/,-) = [1.54 ± 
0.38(stat.) ± 0.37(syst.)] x 10“^ [S] and supersedes it. A 
combination with the recent Belle measurement based on 
the hadronic tagging method [10] of [0.72lg;25(stat.) ± 
0.11(syst.)] X lO”'^, taking into account all correlated 
systematic uncertainties, gives a branching fraction of 
B{B+ T+Vr) = [0.91±0.19(stat.)±0.11(syst.)] x 
with a combined significance of 4.6 a. This value is con¬ 
sistent with the SM expectation based on a fit using in¬ 
dependent experimental input |4]. 
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